Website Term Browser: Overcoming language barriers in text retrieval

نویسندگان

  • Anselmo Peñas
  • M. Felisa Verdejo
  • Julio Gonzalo
چکیده

Current search systems fail to satisfy users when the relevant information is written in a foreign language; when the user is not aware of the relevant -perhaps specialized terminology for a given topic; or when the user need is fuzzy and requires assisted search once inside an appropriate web portal. This paper describes an interactive multilingual search system that alleviates such limitations, through the browsing of phrases in different languages after being automatically extracted from the text collection. The evaluation of WTB has been focussed in two aspects: the capability to offer translingual terminology to users, and the usefulness of phrase browsing. In this sense, the evaluation shows that users consider the new level of terminological information useful, as it complements the traditional document ranking outcome.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Text Surrounding Method to Enhance Retrieval of Online Images by Google Search Engine

Purpose: the current research aimed to compare the effectiveness of various tags and codes for retrieving images from the Google. Design/methodology: selected images with different characteristics in a registered domain were carefully studied. The exception was that special conceptual features have been apportioned for each group of images separately. In this regard, each group image surr...

متن کامل

Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions

This issue of JAMIA focuses on natural language processing (NLP) techniques for clinical-text information extraction. Several articles are offshoots of the yearly ‘Informatics for Integrating Biology and the Bedside’ (i2b2) (http://www.i2b2.org) NLP shared-task challenge, introduced by Uzuner et al (see page 552) and cosponsored by the Veteran’s Administration for the last 2 years. This shared ...

متن کامل

The UCSC Table Browser data retrieval tool

The University of California Santa Cruz (UCSC) Table Browser (http://genome.ucsc.edu/cgi-bin/hgText) provides text-based access to a large collection of genome assemblies and annotation data stored in the Genome Browser Database. A flexible alternative to the graphical-based Genome Browser, this tool offers an enhanced level of query support that includes restrictions based on field values, fre...

متن کامل

Natural Language Information Retrievaltrec - 6 Report

Natural language processing techniques may hold a tremendous potential for overcoming the inadequacies of purely quantitative methods of text information retrieval, but the empirical evidence to support such predictions has thus far been inadequate, and appropriate scale evaluations have been slow to emerge. In this chapter, we report on the progress of the Natural Language Information Retrieva...

متن کامل

Evaluating Natural Language Processing Techniques in Information Retrieval: a Trec Perspective

Natural language processing techniques may hold a tremendous potential for overcoming the inadequacies of purely quantitative methods of text information retrieval, but the empirical evidence to support such predictions has thus far been inadequate, and appropriate scale evaluations have been slow to emerge. In this chapter, we report on the progress of the Natural Language Information Retrieva...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of Intelligent and Fuzzy Systems

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2002